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CRISPR (Clustered, Regularly, Inter- 
spaced, Short, Palindromic Repeats) 
loci have been shown to provide prokar- 
yotes with an adaptive immunity against 
viruses and plasmids. CRISPR arrays are 
transcribed and processed into small 
CRISPR RNA molecules, which base- 
pair with invading DNA or RNA and 
lead to its degradation by CRIS PR- 
associated (Cas) protein complexes. 
New spacers can be acquired by active 
CRISPR/Cas systems, and thus the 
sequences of these spacers provide a 
record of the past "infection history" of 
the organism. Recently we used spacer 
sequences from archaeal genomes to infer 
gene exchange events among archaeal 
species and genera and to demonstrate 
that at least in this domain of life 
CRISPR indeed has an anti-viral role. 



In most known ecosystems, bacteria and 
archaea are under constant attack from 
viruses and the impact of this parasitism 
can often shape microbial assemblages. 1 
Prokaryotes have evolved several defense 
systems to protect themselves from viruses, 
allowing for some long-term co-existence 
of viruses and their hosts. Perhaps the 
most elaborate defense system identified 
to date, CRISPR (Clustered, Regularly, 
Interspaced, Short, Palindromic Repeats)/ 
Cas (CRISPR-associated), provides acquired 
and heritable immunity to prokaryotes 
against parasitic selfish elements, such as 
viruses. 2 " 4 These systems contain arrays of 
repeated sequences interspersed by short 
non-repetitive DNA segments (20-50 bp 
long), termed spacers. 5 The spacer array 
can be transcribed and processed into 
small CRISPR RNA (crRNA) molecules. 



These crRNA molecules base-pair with 
invading nucleic acids and mark them for 
degradation. 6 7 The fact that the CRISPR/ 
Cas systems are able to continuously 
acquire new spacers allows the study of 
the history of past infections, as has been 
shown in acidic biofllms. 8,9 

Lately, the primary role of CRISPR as 
an anti-viral defense system, at least in 
Enterobacteriaceae, has been questioned 
by a study showing lack of acquisition 
of new spacers in Escherichia coli strains 
that diverged in the last 250,000 years and 
certainly exposed to numerous bacterio- 
phages. 10 Furthermore, it was suggested 
that some CRISPR systems are actually 
specialized to have an exclusively anti- 
plasmid role. 10 We recently surveyed all 
archaeal spacers, using a BLASTN-based 
pipeline, in order to gain insights into 
which foreign DNA they come into con- 
tact with and whether anti-viral activity is 
their major function. 11 

About 50% of spacers that had high 
confidence database matches in our study 
appear to target mobile genetic elements. 
Over 40% of the matches were to viruses 
(including proviruses), implying that for 
archaea, anti-viral defense is probably the 
major known function of CRISPR/CAS 
systems. Further support for this notion 
was obtained when we examined the 
location of anti-viral spacers within the 
CRISPR array. Freshly-acquired spacers 
are incorporated next to the leader se- 
quence of the CRISPR/Cas system, while 
older spacers are found toward the other 
end of the array. Newly acquired spacers 
will not have had time to undergo 
purifying selection, since there has not 
been sufficient time for mutations to 
accumulate. In contrast, older spacers can 
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accumulate mutations and be lost unless 
they are under selective pressure for their 
retention. This implies that over longer 
periods of time only spacers that are 
advantageous to the host organism would 
persist. If anti-viral spacers are indeed the 
most beneficial to the host one would 
expect enrichment for virus-matching 
spacers compared with other spacers 
toward the end of the array. Indeed, 
anti-viral spacers were significantly more 
distal on average, further supporting 
the importance of the anti-viral role of 
CRISPR/Cas in archaea. There were 
many instances where spacers from one 
genus matched a virus known to infect 
another, including 30 CRISPR spacers 
of Metallosphaera sedula (family Sulfolo- 
baceae) that matched the genome of the 
Acidianus two-tailed virus. 12 ' 13 Thus, some 
archaeal viruses may infect a relatively 
broad host range. 

Viruses were by no means the only 
mobile element matching archaeal spacers 
—8.3% and 8.9% of hits matched 
plasmids and transposable elements, 
respectively. Strikingly, nearly 18% of 
the database hits were to CRISPR/CAS 
genes. This observation is somewhat 
expected since CRISPR loci themselves 
are often encoded on plasmids. We 
speculated that having one CRISPR/Cas 
system decreases the chances of acquiring 
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